Rank | Count | Beginning |
---|---|---|
64807 | 1062 | 概要 |
112 | 733 | また、 |
1 | 566 | ; |
67303 | 391 | 歴史 |
8 | 297 | ) |
81776 | 277 | 経歴 |
528 | 219 | なお、 |
49351 | 213 | その後、 |
155 | 211 | しかし、 |
69315 | 208 | 沿革 |
63037 | 191 | 来歴 |
105 | 177 | 。 |
11 | 165 | 、 |
36368 | 160 | 地理 |
18613 | 131 | 人物 |
75623 | 131 | 略歴 |
19329 | 119 | その他 |
9 | 118 | 」 |
74774 | 112 | 生涯 |
194 | 106 | あらすじ |
88314 | 92 | 解説 |
18 | 89 | ストーリー |
76467 | 85 | 登場人物 |
72486 | 81 | 特徴 |
84294 | 79 | 脚注 |
115 | 72 | ただし、 |
13226 | 64 | 一方、 |
63122 | 64 | 来歴・人物 |
96006 | 59 | 関連項目 |
30618 | 58 | 参考文献 |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV